Improved model training and automatic weight adjustment for multi-SNR multi-band speaker identification system

نویسندگان

  • Kenichi Yoshida
  • Kazuyuki Takagi
  • Kazuhiko Ozeki
چکیده

In our previous paper, we presented a speaker identification system using a multi-SNR multi-band method, and reported its robustness against environmental noises. This paper describes two modifications to the system for further enhancement of its noise-robustness. Firstly, 1/f noise is employed instead of white Gaussian noise to make noisy data for training multi-SNR GMMs. Secondly, recombination weights for subband likelihood are automatically adjusted based on the estimated subband noise power. For performance evaluation, text-independent speaker identification experiments were conducted on test speech data created by mixing clean speech data with 5 kinds of environmental noises: “bus,” “car,” “office,” “lobby,” and “restaurant” at 0 and 10 dB SNRs. By the two modifications, the identification error rate was reduced by 30.3% on the average compared with the baseline multi-SNR multi-band method using white Gaussian noise and equal weights.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stream-weight optimization by LDA and adaboost for multi-stream speaker verification

This paper proposes an automatic stream-weight optimization method for noise-robust speaker verification using multi-stream HMMs integrating spectral and prosodic information. The paper first shows the effectiveness of the multi-stream technique in our speaker verification framework. Next, a stream-weight adaptation method combining the linear discriminant analysis (LDA) and Adaboost techniques...

متن کامل

Noise Robust Speaker Identification Using Sub-Band Weighting in Multi-Band Approach

Recently, many techniques have been proposed to improve speaker identification in noise environments. Among these techniques, we consider the feature recombination technique for the multi-band approach in noise robust speaker identification. The conventional feature recombination technique is very effective in the band-limited noise condition, but in broad-band noise condition, the conventional...

متن کامل

MAP combination of multi-stream HMM or HMM/ANN experts

Automatic speech recognition (ASR) performance falls dramatically with the level of mismatch between training and test data. The human ability to recognise speech when a large proportion of frequencies are dominated by noise has inspired the “missing data” and “multi-band” approaches to noise robust ASR. “Missing data” ASR identifies low SNR spectral data in each data frame and then ignores it....

متن کامل

Design of Fuzzy Logic Based PI Controller for DFIG-based Wind Farm Aimed at Automatic Generation Control in an Interconnected Two Area Power System

This paper addresses the design procedure of a fuzzy logic-based adaptive approach for DFIGs to enhance automatic generation control (AGC) capabilities and provide better dynamic responses in multi-area power systems. In doing so, a proportional-integral (PI) controller is employed in DFIG structure to control the governor speed of wind turbine. At the first stage, the adjustable parameters of ...

متن کامل

Eurospeech 2001 -scandinavia a Multi-snr Subband Model for Speaker Identication under Noisy Environments

This paper describes a multi-SNR subband model for speaker identi cation under noisy environments. The model consists of a set of subband GMMs (Gaussian Mixture Models) trained on speech data corrupted with white Gaussian noise at several SNRs. In the recognition stage, an optimal GMM that yields the maximum accumulated likelihood on the whole input frames is selected for each subband. Then the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004